منابع مشابه
Recognizing Uncertainty in Speech
We address the problem of inferring a speaker’s level of certainty based on prosodic information in the speech signal, which has application in speech-based dialogue systems. We show that using phrase-level prosodic features centered around the phrases causing uncertainty, in addition to utterance-level prosodic features, improves our model’s level of certainty classification. In addition, our ...
متن کاملRecognizing emotion in speech
This paper explores several statistical pattern recognition techniques to classify utterances according to their emotional content. We have recorded a corpus containing emotional speech with over a 1000 utterances from different speakers. We present a new method of extracting prosodic features from speech, based on a smoothing spline approximation of the pitch contour. To make maximal use of th...
متن کاملRecognizing Sloppy Speech
As speech recognition moves from labs into the real world, the sloppy speech problem emerges as a major challenge. Sloppy speech, or conversational speech, refers to the speaking style people typically use in daily conversations. The recognition error rate for sloppy speech has been found to double that of read speech in many circumstances. Previous work on sloppy speech has focused on modeling...
متن کاملRecognizing Speech from Sim
In this paper we present and evaluate factored methods for recognition of simultaneous speech from multiple speakers in single-channel recordings. Factored methods decompose the problem of jointly recognizing the speech from each of the speakers by separately recognizing the speech from each speaker. In order to achieve this, the signal components of the target speaker in each case must be enha...
متن کاملUnderstanding Speech without Recognizing Words
This paper describes a system t o exploit non-lexical acoustic cues to listener comprehension in a dialog between a human and a computer. The computer uses text-to-speech synthesis to recite a series of driving directions. It classifies the listener's responses as affirmative or negative based on duration, pitch, and energy; this is used to control flow of the conversation to facilitate the lis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Advances in Signal Processing
سال: 2010
ISSN: 1687-6180
DOI: 10.1155/2011/251753